(19) 



Europaisches Patentamt 
QSXi European Patent Office 



Office europeen des brevets 



(12) 



(11) EP 0 936 772 A2 
EUROPEAN PATENT APPLICATION 



(43) 


Date of publication: 


(51) IntCI 6 : H04L 1/00 




18.08.1999 Bulletin 1999/33 






(21) 


Application number: 99300741.8 






(22) 


Date of filing: 02.02.1999 






(84) 


Designated Contracting States: 


(72) 


Inventors: 




AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 




Sin ha, Deepen 




MC NL PT SE 




Chatham, New Jersey 07928 (US) 




Designated Extension States: 




Sundberg, Carl-Erik Wilhelm 




AL LT LV MK RO SI 




Chatham, New Jersey 07928 (US) 


(30) 


Priority: 11.02.1998 US 22114 


(74) 


Representative: 








Buckley, Christopher Simon Thirsk et al 


(71) 


Applicant: LUCENT TECHNOLOGIES INC. 




Lucent Technologies (UK) Ltd, 




Murray Hill, New Jersey 07974-0636 (US) 




5 Mornington Road 








Woodford Green, Essex IG8 0TU (GB) 



(54) Unequal error protection for perceptual audio coders 



(57) An audio information bit stream including audio 
control bits and audio data bits is processed for trans- 
mission in a communication system. The audio data bits 
are first separated into n classes based on error sensi- 
tivity, that is, the impact ot errors in particular audio data 
bits on perceived quality of an audio signal reconstruct- 
ed from the transmission. Each of the n different classes 
of audio data bits is then provided with a corresponding 
one of n different levels of error protection, where n is 
greater than or equal to two. The invention thereby 
matches error protection for the audio data bits to source 
and channel error sensitivity. The audio control bits may 
be transmitted independently of the audio data bits, us- 
ing an additional level of error protection higher than that 



used for any of the n classes of the audio data bits. Al- 
ternatively, the control bits may be combined with one 
of the n classes of audio data bits and provided with the 
highest of the n levels of error protection. Further pro- 
tection may be provided for the control bits by repeating 
at least a portion of the control bits from a current packet 
of the audio information bit stream in a subsequent 
packet of the audio information bit stream. Moreover, the 
classification of audio data bits into n different classes 
can be implemented on a fixed packet-by-packet basis, 
or in a more flexible, adaptive implementation in which 
different multipacket error protection profiles are used 
for different multipacket segments of a source-coded 
audio signal. 
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Description 

Field of the Invention 

[0001] The present invention relates generally to aii- s 
dio coding devices, and more particularly to techniques 
for providing unequal error protection (UEP) for different 
classes of bits encoded in a perceptual audio coder 
(PAC) or other audio source coding device. 

10 

Background of the Invention 

[0002] Most source coded bit streams exhibit unequal 
sensitivity to bit errors. For example, certain source bits 
can be much more sensitive to transmission errors than 15 
others. Moreover, errors in certain bits, such as control 
bits, may lead to severe error propagation and a corre- 
sponding degradation in reconstructed signal quality. 
Such error propagation can occur, for example, in the 
output audio bits of an audio coder due to the use of 20 
control bits for codebook information, frame size infor- 
mation, synchronization information and so on. The per- 
ceptual audio coder (PAC) described in D. Sinha, J.D. 
Johnston, S. Dorward and S.R Quackenbush, "The Per- 
ceptual Audio Coder,' in Digital Audio, Section 42, pp. 2s 
42-1 to 42-18, CRC Press, 1998, which is incorporated 
by reference herein, attempts to minimize the bit rate 
requirements for the storage and/or transmission of dig- 
ital audio data by the application of sophisticated hear- 
ing models and signal processing techniques. In the ab- 30 
sence of channel errors, a PAC is able to achieve near 
stereo compact disk (CD) audio quality at a rate of ap- 
proximately 128 kbps. At a lower bit rate of 96 kbps, the 
resulting quality is still fairly close to that of CD audio for 
many important types of audio material. 35 
[0003] The rale of 96 kbps is particularly attractive lor 
FM band transmission applications such as in -band dig- 
ital audio broadcasting (DAB) systems, which are also 
known as hybrid in-bandon-channel (HIBOC), all-digital 
IBOC and in-band adjacent channel (IBAC)/in-band re- 40 
serve channel (IBRC) DAB systems. There is also a sim- 
ilar effort underway to provide digital audio broadcasting 
at lower audio bit rates in the AM band. For these AM 
systems, audio bit rates of about 32 to 48 kbps are being 
considered for daytime transmission and about 1 6 kbps 45 
for nighttime transmission. Higher audio bit rates, great- 
er than about 1 28 kbps, are being used in multiple chan- 
nel DAB systems. The transmission channels in the 
above-noted DAB systems tend to be severely bandlim- 
ited and noise limited at the edge of a coverage area, so 
For mobile receivers, fading is also a severe problem. 
It is therefore particularly important in these and other 
applications to design an error protection technique that 
is closely matched to the error sensitivity of the various 
bits in the compressed audio bit stream. 55 
[0004] PACs and other audio coding devices incorpo- 
rating similar compression techniques are inherently 
packet-oriented, i.e., audio information for a fixed inter- 
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val (frame) of time is represented by a variable bit length 
packet. Each packet includes certain control information 
followed by a quantized spectral/subband description of 
the audio frame. For stereo signals, the packet may con- 
tain the spectral description of two or more audio chan- 
nels separately or differentially, as a center channel and 
side channels (e.g., a left channel and a right channel). 
Different portions of a given packet can therefore exhibit 
varying sensitivity to transmission errors. For example, 
corrupted control information leads to loss of synchro- 
nization and possible propagation of errors. On the other 
hand, the spectral components contain certain inter- 
frame and/or interchannel redundancy which can be ex- 
ploited in an error mitigation algorithm incorporated in a 
PAC codec. Even in the absence of such redundancy, 
the transmission errors in different audio components 
have varying perceptual implications. For example, loss 
of stereo separation is far less annoying to a listener 
than spectral distortion in the mid-frequency range in the 
center channel. 

[0005] Unequal error protection (UEP) techniques are 
designed to match error protection capability with sen- 
sitivity to transmission errors, such that the most impor- 
tant bits are provided with the highest level of error pro- 
tection, while less important bits are provided with a 
lesser level or levels of error protection. A conventional 
two-level UEP technique for use in DAB applications is 
described in N.S. Jayant and E.Y. Chen, "Audio Com- 
pression: Technology and Applications," AT&T Techni- 
cal Journal, pp. 23-34, Vol. 74, No. 2, March-April 1995. 
In this technique, which is based on a Reed-Solomon 
(RS) code, the control information is protected more ro- 
bustly since it is not possible to use error mitigation on 
the non-redundant control information. In fact, the prop- 
er operation of the error mitigation algorithm used in a 
PAC codec is itself dependent upon reliable control in- 
formation. All of the non-control spectral information in 
this technique is protected using a uniform level of error 
protection. 

Summary of the Invention 

[0006] We have realized that a significant problem 
with the above-described unequal error protection 
(UEP) technique and other conventional UEP tech- 
niques is that these techniques fail to exploit the unequal 
impact on perceived audio quality of transmission errors 
in various spectral components. Moreover, the conven- 
tional techniques are often not directly applicable to a 
wide variety of bit rates and to transmission channels 
other than radio broadcasting channels. 
[0007] The present invention provides methods and 
apparatus for implementing UEP in a source coded bit 
stream such as that generated by a perceptual audio 
coder (PAC). In an illustrative embodiment, an audio in- 
formation bit stream includes audio control bits and au- 
dio data bits. The data bits are first separated into n dif- 
ferent classes of bits, where n is greater than or equal 
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to two, based on the sensitivity of the bits to source and 
channel errors. Each of the n different classes of data 
bits is then encoded in accordance with a corresponding 
one of ndifferent levels of error protection. The invention 
thus separates audio data bits into classes based on the 
impact of errors in the data bits on perceived quality of 
an audio signal reconstructed from the transmitted en- 
coded audio, and matches the error protection provided 
for the audio data bits to error sensitivity. For example, 
the data bits may be separated such that data bits cor- 
responding to a side audio channel are assigned to a 
class which receives a lower level of error protection 
than data bits corresponding to a center audio channel. 
As another example, data bits corresponding to desig- 
nated frequency range, such as 100 Hz to 4 kHz, may 
be assigned to a class which receives a higher level of 
error protection than data bits corresponding to another 
frequency range. 

[0008] The audio control bits may be encoded inde- 
pendently of the audio data bits, using an additional level 
of error protection higher than that used for any of the n 
classes of the data bits. Alternatively, the control bits 
may be combined with one of the n classes of bits and 
encoded in accordance with the highest level of error 
protection of the n levels. Further protection may be pro- 
vided for the control bits by repeating at least a portion 
of the control bits from a current packet in the audio bit 
stream in a subsequent packet of the audio bit stream. 
The UEP techniques of the invention may be designed 
to provide the same total bit rate as an equal error pro- 
tection (EEP) technique. 

[0009] The invention is applicable not only to PACs 
but also to othertypes of audio compression techniques 
operating over a wide range of bit rates, and can be used 
with transmission channels other than radio broadcast- 
ing. The above -described classification of data bits into 
n different classes can be implemented on a fixed pack- 
et-by-packet basis, or in a more flexible, adaptive imple- 
mentation in which different multipacket error protection 
profiles are used for different multipacket segments of 
a source-coded audio signal. The invention can be im- 
plemented with a variety of different types of codes, in- 
cluding Reed-Solomon codes and rate-compatible 
punctured convolutional (RCPC) codes, as well as other 
types of block and convolutional codes. Moreover, the 
invention could use other techniques to provide different 
levels of error protection for different classes of bits, in- 
cluding, for example, coded modulation, higher level 
modem constellations and different transmission power 
levels, as well as other well-known techniques for im- 
plementing UEP. Communication systems with UEP in 
accordance with the invention exhibit a more graceful 
degradation in the presence of channel errors, as well 
as an extended operating range, relative to systems with 
conventional UEP or EEP systems. 



Brief Description of the Drawings 

[0010] FIG. 1 shows a simplified packet format for a 
bit stream generated by a perceptual audio coder (PAC). 

5 [0011] FIG. 2 is a block diagram of a communication 
system which implements three-level unequal error pro- 
tection (UEP) in accordance with the invention. 
[0012] FIGS. 3Aand3B illustrate further details of the 
PAC packet format of FIG. 1 . 

10 [0013] FIG. 4 shows a classification strategy for 
monophonic spectral information for use in classifying 
audio bits for the purpose of providing multi-level UEP 
in accordance with the invention. 
[0014] FIG. 5 illustrates one possible implementation 

is of a multipacket error protection profile technique in ac- 
cordance with the invention. 

[0015] FIGS. 6A and 6B illustrate techniques for im- 
plementing two-level UEP using a fixed block length and 
a fixed information word length, respectively, in accord- 

20 ance with the invention. 

[0016] FIG. 7 illustrates the use of rate-compatible 
punctured convolutional (RCPC) codes in providing 
two-level UEP in accordance with the invention. 
[0017] FIG. 8 illustrates the use of cyclic redundancy 

25 codes (CRC) in providing error mitigation in conjunction 
with a two-level UEP technique in accordance with the 
invention. 

Detailed Description of the Invention 

30 

[0018] The invention will be described below in con- 
junction with exemplary unequal error protection (UEP) 
techniques for use with a perceptual audio coder (PAC) 
such as that described in D. Sinha, J.D. Johnston, S. 

35 Dorward and S.R. Quackenbush, "The Perceptual Au- 
dio Coder," in Digital Audio, Section 42, pp. 42-1 to 
42-18, CRC Press, 1998. It should be understood, how- 
ever, that the UEP techniques of the invention may be 
applied to many other types of audio coding devices. In 

40 addition, the invention maybe utilized with a wide variety 
of different types of audio communication channels, in- 
cluding the Internet, cellular multimedia channels, sat- 
ellite channels, wireless cable, wireless local loop, high- 
speed wireless access and the like. Although illustrated 

45 in a system in which different codes are used to provide 
different levels of error protection, any of a number of 
alternative techniques can be used to provide unequal 
error protection for different classes of bits in accord- 
ance with the invention. These alternative techniques 

so include, for example, coded modulation techniques, 
higher level modem constellations and different trans- 
mission power levels. 

[0019] FIG. 1 illustrates an exemplary three-level 
UEP coding technique in accordance with the invention. 
55 First and second PAC audio packets /and /+1 are shown 
in FIG. 1 . The first PAC audio packet / includes a control 
bit portion 1 0j, a level 1 audio data bit portion 12, and a 
level 2 audio data bit portion 1 4,. The second PAC audio 
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packet h-1 includes a control bit portion 10 j+1 , a level 1 
audio data bit portion 12^ and a level 2 audio data bit 
portion 1 4 j+1 . The audio data portions are source encod- 
ed using Huffman codes. Ail other PAC packets in a giv- 
en audio information bit stream will be assumed to be 
arranged in the same manner. The control information 
in the control portions of the PAC audio packets gener- 
ally represents a relatively small portion of the overall 
bitstream, and normally incorporates no redundancy. As 
will be described in greater detail below, the control in- 
formation may be provided with a higher level of error 
protection in part by repeating the control information 
from one packet in one or more other packets. 
[0020] The exemplary three-level UEP coding tech- 
nique illustrated in FIG. 1 utilizes a first code for the con- 
trol portions of the PAC audio packets (i.e., class 1 bits), 
a second code for the level 1 audio data portions (i.e., 
class 2 bits), and a third code for the level 2 audio data 
portions (i.e., class 3 bits). The class 1 bits are the most 
important bits and are therefore protected with a low- 
rate (i.e., high redundancy) code, while the class 2 and 
class 3 bits are of increasingly lesser importance and 
are therefore protected with increasingly higher-rate (i. 
e., lower redundancy) codes. In the illustrative embodi- 
ment of FIG. 1, the control information is thus treated 
separately from the audio information, and is provided 
with the highest level of error protection, while the audio 
data is separated into two distinct classes of bits, each 
of which is provided with a different level of error protec- 
tion. 

[0021] The relative allocation of audio bits into one of 
the two audio data classes 2 and 3 and the correspond- 
ing rates of the respective second and third error pro- 
tection codes are collectively referred to as the error pro- 
tection profile of the UEP technique. Error protection 
profiles in accordance with the invention may be con- 
strained to ensure that the overall bit rate is substantially 
equivalent to that provided by a conventional equal error 
protection (EEP) technique. In one possible implemen- 
tation of the above-described three-level UEP tech- 
nique, the second and third codes may be fixed-rate er- 
ror protection codes. This will generally mean that the 
resulting UEP technique can provide substantially the 
same average source and channel rates as a corre- 
sponding conventional EEP technique. In addition, the 
relative allocation of audio data bits between the second 
and third classes may be the same for all packets. 
[0022] FIG. 2 is a blockdiagram of an exemplary com- 
munication system 20 which implements the above-de- 
scribed three-level UEP technique in accordance with 
the invention. The system 20 includes a transmitter 22 
and a receiver 24 which communicate over a transmis- 
sion channel 26. The transmitter 22 includes a PAC en- 
coder 30 for generating a sequence of PAC audio pack- 
ets from an analog audio signal. Although this embodi- 
ment uses PAC audio packets, the invention is more 
generally applicable to audio information generated by 
any type of audio compression technique. The audio 



packets from PAC encoder 30 are applied to a classifier 
32 which converts the packets into separate bit streams 
corresponding to control information, level 1 audio data 
bits and level 2 audio data bits. The control information 

5 bit stream is supplied to a first channel coder 33, the 
level 1 audio data bit stream is supplied to a second 
channel coder 34, and the level 2 audio data bit stream 
is supplied to a third channel coder 35, as shown in FIG. 
2. The symbol outputs of the channel coders 33, 34 and 

to 35 are supplied to an interleaver 36 which provides in- 
terleaving of the symbols over multiple PAC packets. 
The transmitter 22 may include additional processing el- 
ements, such as a modulator, a multiplexer, an upcon- 
verter and the like, which are not shown in FIG. 2 for 

is simplicity of illustration. For example, the output of the 
interleaver 36 may be modulated onto one or more sub- 
carriers, and the sub-carriers frequency modulated onto 
a radio frequency (RF) carrier for transmission to the re- 
ceiver 24. As another example, the output of the inter- 

20 leaver 36 may be applied 1o a modem for modulation 
onto carriers for transmission over a telephone line or 
other network connection. 

[0023] The receiver 24 receives the transmitted signal 
• from the transmission channel 26, and processes it to 

25 recover the interleaved symbols. The symbols are dein- 
terleaved in a deinterleaver 40, and then applied to first, 
second and third channel decoders 41, 42 and 43, re- 
spectively, as shown in FIG. 2. The bit stream at the out- 
put of the first decoder 41 corresponds to the control 

30 information, the bit stream at the output of the second 
decoder 42 corresponds to the level 1 audio data bit 
stream, and the bit stream at the output of the third de- 
coder 43 corresponds to the level 2 audio data bit 
stream. The audio data bit streams are then processed 

35 in respective error mitigation units 44, 46, and the re- 
sulting processed audio data bit streams are applied 
with the control information bit stream from decoder 41 
to a reconstruction unit 47. The reconstruction unit 47 
processes the three bit streams applied thereto to re- 

40 construct PAC packets of the type shown in FIG. 1 . The 
resulting sequence of PAC packets are then decoded in 
a PAC audio decoder 48 to reconstruct the original an- 
alog audio signal. 

[0024] Like the transmitter 22, the receiver 24 may in- 
45 elude additional processing elements which are not 
shown in FIG. 2. It should also be noted that various 
elements of the system 20, such as the interleaver 36 
and the deinterleaver 40, may be eliminated in alterna- 
tive embodiments. Moreover, various elements of the 
so system 20, such as the audio encoder and decoder, the 
channel coders and decoders, the classifier and recon- 
struction unit, and the error mitigation units, may be im- 
plemented using portions of an application-specific in- 
tegrated circuit, microprocessor or any other type of dig- 
55 jtal data processor. Various aspects of the invention may 
also be implemented in the form of one or more software 
programs executed by a central processing unit (CPU) 
or the like in the digital data processor. 
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[0025] A simplified two-level version of the three-level 
UEP technique described in conjunction with FIGS. 1 
and 2 combines the control information of class 1 with 
the level 1 audio data of class 2 to form a new class, 
referred to as class 1 *, which is protected by a single s 
error protection code. The subset of class 1 * which cor- 
responds to the control information of original class 1 
may be further protected using the above-noted inter- 
packet repeat feature. For example, the control informa- 
tion for a current PAC packet may be included in both 
the current packet and the next packet. The error cor- 
rection capability of an embodiment with this type of re- 
peat code may be further enhanced by the use of an 
interleaver which operates on the coded bitstream to en- 
sure that the repeated control information is well sepa- 
rated in time from the original control information. The 
simplified two-level version of the UEP technique may . 
be viewed as operating on a two-class partitioned bit 
stream in which each packet is effectively partitioned in- 
to two classes: the above-defined class 1* and a class 
2* which corresponds to the original class 3. Suitable 
channel codes for use with class 1* and class 2* will be 
described in greater detail below. 
[0026] The quantization model used in a PAC encoder 
is essentially an adaptive subband coding scheme. An 
analog audio signal is typically processed in the encoder 
by a time-varying filter bank. The output from the filter 
bank is partitioned into groups or "coder bands" such 
that each coder band has a single quantizer step size 
or "scale factor" associated with it This partitioning is 
based on a "critical band" division of the audio frequency 
range. Critical bands are ranges of frequency over 
which the ear tends to integrate audio stimuli. Usually 
there are one or more coder bands associated with each 
critical band which indicate finer partitioning of the crit- 
ical band audio information in frequency (i.e., if a high 
frequency resolution filter bank is employed) or in time 
(i.e., if a higher time resolution filter bank is employed). 
The scale factors associated with the coder bands are 
generated in each audio frame using a perceptual mask- 
ing model. The scale factors. and the quantized coder 
band components are further compressed using an 
adaptive Huffman coding scheme. Usually one of sev- 
eral possible Huffman codebooks may be selected for 
a given group of coder bands. As previously noted, the 
use of Huffman coding can cause transmission errors 
to be propagated within an audio frame. 
[0027] FIGS. 3A and 3B illustrate an exemplary PAC 
packet in greater detail. FIG. 3A shows the format o1 a 
single exemplary packet, while FIG. 3B shows the for- 
mat of a left channel or right channel data portion of the 
packet of FIG. 3A. PAC coding utilizes a block process- 
ing algorithm and each PAC packet corresponds to 1 024 
input samples from each audio channel, regardless of 
the actual number of audio channels. Each packet con- 
tains the Huffman-encoded filter bank outputs, code- 
book sections, quantizers and channel combination in- 
formation for one 1 024-sample chunk or eight 1 28-sam- 



ple chunks. The packet size is variable, such that pack- 
ets corresponding to different sets of 1024 input sam- 
ples can have different sizes. Depending on the appli- 
cation, various types of additional control information 
are added to a first packet or to every packet. For ex- 
ample, when storing information on a reliable medium, 
such as a hard disk, a header indicating version, sample 
rate, number of channels, and encoded rate is placed 
at the beginning of the compressed music. For poten- 
tially unreliable transmission channels, such as those 
used in digital audio broadcasting, a header is generally 
added to each packet. This header can contains control 
information such as synchronization, error recovery, 
sample rate, number of audio channels, and the trans- 
mission bit rate. 

[0028] In the exemplary PAC packet of FIG. 3A, a 
header portion of the packet includes a 32-bit sync, an 
8-bit bitrate indicator, 4-bit I and O sample rate indica- 
tors, 13-bit block length, half block length, next block 
length and next half block length indicators, a 4-bit buffer 
state indicator, a 6-bit ancillary data length indicator, and 
a 10-bit auxiliary data length indicator. This header is 
followed by a variable length left channel data portion, 
a variable length right channel data portion, and variable 
length ancillary and auxiliary data portions. FIG. 3B 
shows that the left or right channel data portions may 
include sectioning information (e.g. , number of sections, 
section boundaries and codebooks), scale factors and 
coefficients. Additional details regarding the PAC packet 
format shown in FIGS. 3A and 3B can be found in the 
above-cited PAC reference and the format will therefore 
not be further described herein. 

[0029] The process of classifying a PAC audio bit- 
stream into unequal error sensitivity classes will now be 
described in greater detail. In the simplified two-level 
version of the three-level embodiment described in con- 
junction with FIGS. 1 and 2 above, the relative sizes of 
class 1* and class 2* may be maintained substantially 
constant from packet to packet, i.e., a fixed P% of bits 
in each packet are classified as critical The operating 
characteristics of the classifier which allocates bits to 
the two classes, together with the parameter P, deter- 
mine the relative rates of the two classes, and may be 
selected based on perceptual masking principles and 
subjective quality measurements. Two illustrative per- 
ceptual masking principles which may be used in the 
classifier are as follows: (1 ) the stereo separation of the 
signal, i.e., side channels are deemed to perceptually 
less significant than the center channel: and (ii) a group 
of coder bands which correspond to a frequency range 
or set of frequency ranges which have been determined 
to be perceptually more critical to the quality of the re- 
constructed audio. Typically, the low to mid frequency 
coder bands, corresponding generally to the frequency 
range of about 100 Hz to 4 kHz, are more critical than 
the rest of the spectral information in the audio bit- 
stream. Experiments indicate that a relatively large val- 
ue of P is often desirable, due to the fact that a relatively 
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large number of bits are consumed by the low to mid 
frequency coder bands (as much as 60-70% for certain 
packets). In many applications, a value of P of about 
50% will be acceptable for use. 

[0030] The operation of a classifier configured to uti- 5 
lize the above principles will now be described. For each 
packet with a size K bits, a "bit bucket" of size P*K/100 
is created. The bit bucket is then filled with bits from the 
packet by applying a sequence of perceptually-driven 
selection rules. First, the control information is placed in 
the bucket. Next, the spectral information corresponding 
to the frequency range 1 00 Hz to 2.5 kHz for the mono- 
phonic or center channel is placed in the bucket. For 
coder bands utilizing Left/Right coding, bits for both the 
left and right channels are placed in the bucket, but for 
coder bands utilizing Sum/Diff coding, only the sum bits 
are included in the bucket. Next, all or part of the mono- 
phonic or center channel information corresponding to 
the frequency range 2.5 kHz to 4 kHz goes into the buck- 
et. Since it is often not possible to incorporate all of these 
bits into the bucket, a sub-critical band comb may be 
applied to the spectral bits to select certain pairs of these 
bits for placement in the bucket. If the bucket is still not 
full after application ol the band comb, bits are chosen 
for inclusion in the bucket in the following order: (1 ) dif- 
ference channel bits in the 100 Hz to 4 kHz range, (2) 
channel bits in the 0 to 100 Hz range, (3) monophonic 
or center channel bits in the 4 to 8 kHz range, (4) differ- 
ence channel bits in the 4 to 8 kHz range, and (5) higher 
frequency spectral information greater than 8 kHz. 
[0031] The bits which are placed in the above-de- 
scribed bit bucket are the class 1* bits, and therefore 
represent the more critical audio information which is 
provided with a higher level of error protection. The re- 
mainder of the bits in the audio packet form the class 2* 
bits for the packet and are provided with a lower level of 
error protection. Specific channel codes suitable for use 
with these two classes of bits will be described in greater 
detail below. The different levels of error protection are 
generally selected such that overall distortion in the 
quality of the reconstructed audio is minimized. This de- 
termination can be made subjectively, by listening to the 
impact of simulated errors and adjusting the error pro- 
tection levels accordingly. Objective distortion criteria 
could also be used but may in some cases produce sub- 
optimal results. 

[0032] FIG. 4 illustrates the operation of the classifier 
for monophonic or center channel spectral information 
in one possible implementation. The classifier for the 
ce '-er channel spectral information is shown as a fre- 
q cy plot designating which frequency ranges are as- 
Sic ed to class 1* (shown by a solid line) and which are 
assigned to class 2* (shown by a dashed line). Other 
embodiments of the invention can of course utilize other 
types of classifiers. 

[0033] The above-described three-level UEP tech- 
nic;.!? can also be extended to provide more sophisticat- 
ed types of protection. For example, the audio data may 



be classified into more than two classes of error sensi- 
tivity, with the respective sizes and code rates of the 
classes adjusted appropriately. The error protection pro- 
file may still be made constant from packet to packet. 
Such further subclassification of the two classes of au- 
dio data may be used to better exploit the error mitiga- 
tion algorithms. For example, PAC uses an interpolative 
error mitigation algorithm in which information from past 
and future audio packets is utilized to generate any 
missing packets. Some parts of the spectral audio de- 
scription used in PAC yield better interpolation than oth- 
ers and this fact may be utilized to classify the informa- 
tion in different error protection classes. 
[0034] FIG. 5 illustrates another possible extension of 
the above-described UEP techniques. This extension 
utilizes a multi-band audio signal loudness profile over 
a multiple audio frame duration to determine an appro- 
priate multipacket error protection profile for the corre- 
sponding group of multiple PAC audio packets. This pro- 
file, which may be selected from a finite number of pos- 
sible profiles, will then be transmitted as part of the for- 
ward control information to the receiver. FIG. 5 shows a 
portion of a transmitter 50 in which a source-coded au- 
dio information bit stream is applied to an error sensitiv- 
ity classifier 52. The classifier 52 operates on a segment 
of multiple PAC packets as shown. Each of the PAC 
packets in the multipacket segment corresponds to a 
single audio frame. The classifier 52 separates the au- 
dio data bits in each of the packets of the multipacket 
segment into n different classes of audio data bits based 
on error sensitivity, in a manner similar to that previously 
described. The audio data bits from each of these class- 
es are supplied via lines 54 to an encoder 56 which in- 
cludes individual forward error correction (FEC) encod- 
ers 57 for each of the classes of audio data bits. Each 
of the individual encoders 57 provides a different level 
of error protection for the class of audio data bits applied 
thereto. The control bits from the audio bit stream may 
be combined with the most error sensitive class of the 
audio data bits and delivered via one of the lines 54 to 
one of the encoders 57. Alternatively, a separate one of 
the lines 54 and encoders 57 may be used to encode 
the control bits. 

[0035] An identifier of the particular error protection 
profile selected for the multipacket segment is supplied 
from the classifier 52 via line 58 to another encoder 59. 
This identifier specifies the particular error protection 
profile utilized for the corresponding multipacket seg- 
ment, and is therefore protected with a relatively high 
level of error protection. The transmitter 50 can thus be 
used to select a different UEP profile for different multi- 
packet segments of an audio information bit stream, 
such that the selected profile may be varied in accord- 
ance with the characteristics of the audio signal. In an 
alternative embodiment, the control bits from the input 
audio bit stream may be combined with the UEP profile 
identifier and coded by the encoder 59, rather than being 
combined with one of the classes of audio data bits or 
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separately applied to one of the individual encoders 57. 
The outputs of the individual encoders 57, 59 of encoder 
56 are applied to a multiplexer, modulator or other de- 
vice for further processing prior to transmission. 
[0036] The use of a multipacket error protection pro- 
file as described in conjunction with FIG. 5 is particularly 
attractive for PAC applications operating at lower bit 
rates and lower audio bandwidths. For example, a 32 
kbps PAC output attempts to code only 7-8 kHz of audio 
bandwidth. In this case, a majority of output bits in sev- 
eral packets may correspond to critical audio informa- 
tion and a 50-50 split between two classes for each 
packet may not yield any significant quality advantage. 
On the other hand, a multipacket error protection profile 
will be able to exploit the variability in the bit demand as 
well as the interpolative capabilities of the PAC error mit- 
igation algorithm to classify a suitable multipacket sub- 
set of the critical information as a separate class. Pro- 
tecting this multipacket critical class with a stronger level 
of error protection is then likely to provide improved per- 
formance relative to a UEP technique which attempts to 
separate packets into classes of bits using an identical 
profile for every packet. 

[0037] Another application in which a multipacket er- 
ror protection profile can be particularly useful is in high 
fidelity reproduction of onsets in an audio signal. This is 
an important quality criterion for an audio compression/ 
transmission scheme. Such onsets, although infre- 
quent, require a large number of bits for encoding. Fur- 
thermore, for onsets, higher frequency audio compo- 
nents should be classified as critical as well. It is there- 
fore apparent that for a PAC audio packet containing an 
onset, a simple partition into two equal-size classes in 
accordance with a single-packet error profile may not 
provide optimal results. On the other hand, a multipack- 
et error profile may be configured to allow most of the 
bits from an onset PAC packet to be classified as critical. 
Although this will generally imply that fewer bits from 
other packets in the same multipacket segment can be 
classified as critical, there may be less demanding audio 
packets in the segment which can benefit from other fea- 
tures such as interpolation. 

[0038] FIGS. 6A and 6B illustrate two possible tech- 
niques of employing block codes to provide two-level 
UEP in accordance with the invention. The block codes 
may be, for example, Reed-Solomon codes, as de- 
scribed in greater detail in S. Lin and D.J. Costello Jr., 
"Error Control Coding: Fundamentals and Applications, 
" Prentice-Hall, 1983, which is incorporated by refer- 
ence herein. In this case, two different Reed-Solomon 
codes may be used to provide two levels of error pro- 
tection, with the average rate of the two codes equiva- 
lent to the rate which would be used to provide EEP for 
both classes. FIG. 6A illustrates an implementation 
which utilizes code words of a fixed length m. A code- 
word 100 configured to provide EEP for two different 
classes includes a first portion of length /cand a second 
portion of length m-k, where k\s the number of informa- 



tion symbols. An exemplary pair of codewords 102 and 
104 are configured in accordance with a two-level UEP 
technique such as that described previously. Codeword 
102 includes a first portion of length k y and a second 
5 portion of length m-k v while codeword 104 has a first 
portion of length and a second portion of length m- 
k 2 . In this example, the number of information symbols 
/t, in codeword 102 is less than m, while the number of 
information symbols k 2 in codeword 1 04 is greater than 

[0039] FIG. 6B illustrates an implementation which 
utilizes variable length code words but a fixed number 
of information symbols k. A codeword 1 06 configured to 
provide EEP for two different classes of bits includes a 

is first portion of length k and a second portion of length 
m-k, as in the codeword 100 of FIG. 6A. FIG. 6B also 
shows an exemplary pair of codewords 108 and 110 
configured in accordance with a two-level UEP tech- 
nique. Codeword 108 includes a first portion of length k 

20 and a second portion of length -k, while codeword 
110 has a first portion of length k, and a second portion 
of length nr^-k. In this example, the length m, of code- 
word 108 is greater than k, while the length m 2 of code- 
word 1 1 0 is less than k. Shortened forms may be used 

25 for the Reed-Solomon codes in this embodiment. Inter- 
leaving should be performed at the code symbol level. 
In an embodiment using Reed-Solomon codes, an error 
detection flag for use in error mitigation can be obtained 
automatically from a decoder. 

30 [0040] Othertypes of codes could also be used to pro- 
vide multi-level UEP in accordance with the invention, 
including convolutional codes, as described in, for ex- 
ample, the above-cited S. Lin and D.J. Costello Jr. ref- 
erence, and G.C. Clark, Jr. and J.B. Cain, "Error Cor- 

35 recting Codes for Digital Communications," Plenum 
Press, 1981, which is also incorporated by. reference 
herein. A particularly efficient and flexible technique for 
achieving multi-level UEP with convolutional codes is 
through the use of rate-compatible punctured convolu- 

40 tional (RCPC) codes, as described in, for example, J. 
Hagenauer, "Rate Compatible Punctured Convolutional 
Codes (RCPC Codes) and their Applications," IEEE 
Transactions on Communications, Vol. 36, No. 4, pp. 
389-400, April 1988, J. Hagenauer et al., "The Perform- 

45 ance of Rate Compatible Punctured Convolutional 
Codes for Digital Mobile Radio," IEEE Transactions on 
Communications, Vol. 38, No. 7, pp. 966-980, July 1990, 
and RV. Cox et al., "Sub-band Speech Coding and 
Matched Convolutional Channel Coding for Mobile Ra- 

50 dio Channels," IEEE Transactions on Acoustics, 
Speech and Signal Processing, Vol. 39, No. 8, pp. 
1717-1731, August 1991, all of which are incorporated 
by reference herein. 

[0041] FIG. 7 illustrates the use of termination bits for 
55 use in conjunction with RCPC codes or other types of 
convolutional codes to provide two-level UEP in accord- 
ance with the invention. An exemplary codeword 120 
includes a first level portion 122, a second level portion 
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124 and a single terminating tail 126. If the overhead 
due to the bits of the terminating tail 126 is negligible, 
the need for rate-compatible codes for the first and sec- 
ond levels is less urgent, and non-rate-compatible 
codes could be used. It is also possible to terminate s 
each level code separately, as illustrated in the code- 
words 130 and 132 of FIG. 7. The codeword 130 in- 
cludes a first level portion 1 33 and a terminating tail 1 34, 
while the codeword 132 includes a second level portion 
and a separate terminating tail 1 36. The additional over- 10 
head for separate terminating tails in two-level UEP may 
still be negligible in many applications. The use of punc- 
tured convolutional codes allows the same basic decod- 
er to be used for all codes derived from the same mother 
code. 15 
[0042] As noted above, the use of Reed-Solomon 
codes permit an error detection flag for error mitigation 
to be obtained automatically from the decoder. In em- 
bodiments of the invention which make use of convolu- 
tional codes, a number of different techniques may be zo 
used to obtain an error detection flag for error mitigation. 
One possible technique is to utilize a generalized Viterbi 
algorithm, as described in, for example, N. Seshadri and 
C-E. W. Sundberg, "Generalized Viterbi Algorithms lor 
Error Detection with Convolutional Codes," Conf. Rec. 25 
GLOBECOM '89, pp. 1534-1538, Dallas, Texas, No- 
vember 1989, N. Seshadri and C-E. W. Sundberg, "List 
Viterbi Decoding Algorithms with Applications," IEEE 
Transactions on Communications, pp. 31 3-323, Vol. 42, 
No. 2/3/4, 1 994, and C. Nill and C-E. W. Sundberg, "List so 
and Soft Symbol Output Viterbi Algorithms: Extensions 
and Comparisons," IEEE Transactions on Communica- 
tions, Vol. 43, No. 2/3/4, 1995, all of which are incorpo- 
rated by reference herein. This technique generally re- 
quires no significant additional overhead beyond that al- 35 
ready used in the convolutional code, but may require 
more frequent use of termination tails and more complex 
processing. 

[0043] FIG. 8 illustrates a second possible technique 
for obtaining an error detection flag for use in error mit- *° 
igation. This technique involves using a cyclic redun- 
dancy code (CRC) before the convolutional code. In ac- 
cordance with this technique, CRC bits are inserted pe- 
riodically in the PAC bitstream, and implemented sepa- 
rately for each level of error protection. As illustrated in 45 
FIG. 8, a set of first level bits 140 in the PAC audio bit 
stream includes groups of first level bits 142 inter- 
spersed with corresponding CRC bits 142. Similarly, a 
set of second level bits 150 in the PAC audio stream 
includes groups of second level bits interspersed with so 
corresponding CRC bits 154. This CRC technique has 
the advantage of being simple to implement and use. A 
third possible technique for obtaining an error detection 
flag for use in error mitigation is to use a high-rate Reed- 
Solomon code after the convolutional code. It should be 55 
noted that alternative techniques for generating an error 
detection flag, other than the three exemplary tech- 
niques described above, may also be used. 



[0044] The above-described embodiments of the in- 
vention are intended to be illustrative only. For example, 
alternative embodiments of the invention may utilize 
block or convolutional codes other than the exemplary 
Reed-Solomon or RCPC codes described above. More- 
over, techniques other than coding may be used to pro- 
vide different levels of error protection. In addition, the 
techniques of the invention may be used to provide any. 
number of different levels of UEP for audio information, 
arid may be used with a wide variety of different bit rates 
and transmission channels. Alternative embodiments 
can extend the illustrative two-level and three-level 
techniques described above to any desired number of 
levels in a straightforward manner. Other alternative em- 
bodiments can utilize any number of different multipack- 
et error protection profiles for processing segments of 
multiple audio packets. Further embodiments could use 
other techniques for providing an adaptive number of 
different levels of error protection in a given application. 
These and numerous other alternative embodiments 
and implementations within the scope of the following 
claims will be apparent to those skilled in the art. 



Claims 

1 . A method of processing a stream of audio informa- 
tion bits for transmission in a communication sys- 
tem, the audio information bits including control bits 
and data bits, the method including the steps of: 

separating the audio data bits into n different 
classes of audio data bits based on error sen- 
sitivity, where n is greater than or equal to two; 
and 

providing a different level of error protection for 
each of the n different classes of audio data 
bits. 

2. A method as claimed in claim 1 wherein the provid- 
ing step includes coding the control bits using a lev- 
el of error protection higher than that used for any 
of the n classes of the data bits. 

3. A method as claimed in claim 1 wherein the sepa- 
rating step includes separating the audio data bits 
into the n classes of data bits based on the impact 
of the data bits on perceived quality of an audio sig- 
nal reconstructed from the audio information bits. 

4. A method as claimed in claim 3 wherein the sepa- 
rating step includes separating the data bits such 
that data bits corresponding to a side audio channel 
are assigned to a class with a lower level of error 
protection than that of data bits corresponding to a 
center audio channel. 

5. A method as claimed in claim 3 wherein the sepa- 
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rating step includes separating the data bits such 
that data bits corresponding to a designated fre- 
quency range are assigned to a class with a higher 
level of error protection than that of data bits corre- 
sponding to another frequency range. 

6. Apparatus for use in processing audio information 
for transmission in a communication system, the 
apparatus comprising: 

a classifier for receiving a stream of audio in- 
formation bits, wherein the classifier is opera- 
tive to separate the audio information bits into 
control bits and n different classes of data bits, 
where n is greater than or equal to two, and the 
data bits are separated into the classes based 
on error sensitivity of the data bits; and 
at least n encoders, each of the n encoders re- 
ceiving the data bits of one of the n classes, and 
coding the nth class of data bits so as to provide 
a corresponding one of n levels of error protec- 
tion for that class of data bits. 

7. A method as claimed in claim 1 or apparatus as 
claimed in claim 6 wherein the n different levels of 
error protection include a first level of error protec- 
tion for a first class of the data bits and a second 
level of error protection for a second class of the 
data bits. 

8. A method as claimed in claim 7 wherein the provid- 
ing step includes coding the control and data bits 
such that the control bits and the first class of data 
bits are both provided with the first level of error pro- 
tection. 

9. Apparatus as claimed in claim 6 including an addi- 
tional encoder for encoding the control bits using a 
level of error protection higher than that used for any 
of the n classes of the data bits. 

10. Apparatus as claimed in claim 6 wherein one of the 
encoders is operative to encode the control bits and 
a given one of the classes of data bits such that both 
the control bits and the given class of data bits are 
both provided with the same level of error protec- 
tion. 

11. A method as claimed in claim 1 or apparatus as 
claimed in claim 6 wherein at least a portion of the 
control bits from a current packet in the audio infor- 
mation bit stream are repeated in a subsequent 
packet of the audio information bit stream. 

12. Apparatus as claimed in claim 6 wherein the clas- 
sifier is operative to separate the data bits into the 
n classes of data bits based on the impact of the 
data bits on perceived quality of an audio signal re- 



16 

constructed from the audio information bits. 

1 3. Apparatus as claimed in claim 1 2 wherein the clas- 
sifier separates the data bits such that data bits cor- 
s responding to a side audio channel are assigned to 
a class with a lower level of error protection than 
that of data bits corresponding to a center audio 
channel. 

10 14. Apparatus as claimed in claim 12 wherein the clas- 
sifier separates the data bits such that data bits cor- 
responding to designated frequency range are as- 
signed to a class with a higher level of error protec- 
tion than that of data bits corresponding to another 

15 frequency range. 

15. A method as claimed in claim 5 or apparatus as 
claimed in claim 14 wherein the designated fre- 
quency range is approximately 100 Hz to 4kHz. 

20 

16. A method as claimed in claim 1 or apparatus as 
claimed in claim 6 wherein a packet of the stream 
of audio information bits has a length of K bits, and 
a first one of the n classes having a highest level of 

25 error protection is approximately of size P*K/100, 
where P is a fixed percentage of bits which will be 
assigned to the first class for multiple packets of the 
audio information bit stream. 

30 17. Apparatus for use in decoding audio information re- 
ceived in a communication system, the apparatus 
including: 

at least n decoders, wherein each of the n de- 
35 coders is operative to decode a portion of a re- 

ceived audio signal to recover a corresponding 
portion of an audio information bit stream in- 
cluding control bits and n different classes of 
data bits, where n is greater than or equal to 
40 two, and each of the n classes of data bits has 

one of n levels of error protection; and 
a reconstruction unit which receives the por- 
tions of the audio information bit stream from 
the decoders and generates audio packets 
45 therefrom. 

18. Apparatus as claimed in claim 17 including an ad- 
ditional decoder for decoding a portion of the re- 
ceived signal corresponding to the control bits, 

50 wherein the control bits have a level of error protec- 
tion higher than that used for any of the n classes 
of the data bits. 

19. Apparatus as claimed in claim 17 wherein one of 
55 the n decoders is operative to decode a portion of 

the received signal corresponding to the control bits 
and a portion of the received signal corresponding 
to a given one of the classes of data bits, wherein 
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both the control bits and the given class of data bits 
both have the same (evel of error protection. 
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bits on perceived quality of an audio signal reconstruct- 
ed from the transmission. Each of the n different classes 
of audio data bits is then provided with a corresponding 
one of n different levels of error protection, where n is 
greater than or equal to two. The invention thereby 
matches error protection for the audio data bits to source 
and channel error sensitivity. The audio control bits may 
be transmitted independently of the audio data bits, us- 
ing an additional level of error protection higher than that 
used for any of the n classes of the audio data bits. Al- 



ternatively, the control bits may be combined with one 
of the n classes of audio data bits and provided with the 
highest of the n levels of error protection. Further pro- 
tection may be provided for the control bits by repeating 
at least a portion of the control bits from a current packet 
of the audio information bit stream in a subsequent 
packet of the audio information bit stream. Moreover, the 
classification of audio data bits into n different classes 
can be implemented on a fixed packet-by-packet basis, 
or in a more flexible, adaptive implementation in which 
different multipacket error protection profiles are used 
for different multipacket segments of a source-coded 
audio signal. 
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